# XLSR-53 Fine-tuning
Exp W2v2t Ja Xlsr 53 S109
Apache-2.0
Japanese automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained using Common Voice 7.0 Japanese dataset
Speech Recognition
Transformers Japanese

E
jonatasgrosman
20
0
Ai Light Dance Singing Ft Wav2vec2 Large Xlsr 53 5gram V1
This model is an automatic speech recognition model based on wav2vec2-large-xlsr-53, fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING dataset, primarily used for singing voice recognition.
Speech Recognition
Transformers

A
gary109
18
1
Wav2vec2 Common Voice Tr Demo Dist
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE - TR Turkish dataset based on facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 0.3242 on the evaluation set.
Speech Recognition
Transformers Other

W
cromz22
26
0
Wav2vec2 Large Xlsr 53 Japanese
Apache-2.0
Japanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampling rate audio input
Speech Recognition Japanese
W
jonatasgrosman
2.9M
33
Wav2vec2 Luganda
Apache-2.0
A Luganda automatic speech recognition system fine-tuned from Facebook's wav2vec2-large-xlsr-53 model, achieving 7.53% WER on the Common Voice Luganda dataset.
Speech Recognition
Transformers Other

W
indonesian-nlp
52
2
Wav2vec2 Large Xlsr Arabic
Apache-2.0
A Wav2Vec2-Large-XLSR-53 model fine-tuned for Arabic speech recognition, trained on the Common Voice and Arabic Speech Corpus datasets
Speech Recognition Arabic
W
mohammed
51
3
Wav2vec2 Common Voice Tr Demo
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE - TR Turkish dataset based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition
Transformers Other

W
shiyue
25
0
Wav2vec2 Large Xlsr Gu
Apache-2.0
Gujarati automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving 23.55% WER on OpenSLR dataset
Speech Recognition Other
W
gchhablani
3,582
0
Wav2vec2 Large Xlsr 53 Chinese Zh Cn
Apache-2.0
A Chinese speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampling rate audio input.
Speech Recognition Chinese
W
jonatasgrosman
3.8M
110
Wav2vec2 Large Xlsr Bengali
A Bengali automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained with 40,000 speech samples from the OpenSLR dataset
Speech Recognition Other
W
arijitx
758
6
Wav2vec2 Hausa2 Demo Colab
Apache-2.0
This model is a Hausa speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers

W
Arnold
19
1
Featured Recommended AI Models